@EA Forum Team

mentions 1 type Person feed RSS

13:56

2026-06-04

forum.effectivealtruism.org

artificial-intelligence

Tamper-Resistance is a Moving Target We Might Not Hit

Open-weight AI models are fundamentally harder to safeguard than closed models because adversaries can extract weights directly and retrain them to bypass embedded protections. In September 2025, the …

// co-occurs with top 3 entities

Evo2 1 Substack 1 NTI 1